To automatically map source code entities to architectural modules with Naive Bayes

نویسندگان

چکیده

The process of mapping a source code entity onto an architectural module is to large degree manual task. Automating this could increase the use static architecture conformance checking methods, such as reflexion modeling, in industry. Current techniques rely on user parameterization and highly cohesive design. A machine learning approach would potentially require less parameters better available information aid automatic mapping. We investigate how classifier can be trained map from modules automatically. This with semantic syntactic dependency extracted descriptions. implemented using multinomial naive Bayes evaluated. perform experiments compare three state-of-the-art functions eight open-source Java systems known ground-truth-mappings. find that outperforms all cases it provides useful baseline for further research area semi-automatic incremental clustering. conclude performs need compared other approaches. Future work includes investigating problematic mappings more diverse set subject systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatically Adapting Source Code to Document Provenance

Being able to ask questions about the provenance of some data requires documentation on each influence on that data’s existence and content. Much software exists, and is being developed, for which there is no provenance-awareness, i.e. at best, the data it outputs can be connected to its inputs, but with no record of intermediate processing. Further, where some record of processing does exist, ...

متن کامل

Naive Bayes and Map-Reduce

We’ll start out with a very simple learning algorithm: multinomial Naive Bayes. Our implementation is in Table 1. Each training example is a labeled document d = (i, y, (w1, . . . , wni)) with an identifier i, a label y from a small set Y = {y1, . . . , yK}, and a “bag of words”. The bag of words are wj’s, encoded here as a list of strings, so that wj is the word/token at position j of document...

متن کامل

Spam Filtering with Naive Bayes - Which Naive Bayes?

Naive Bayes is very popular in commercial and open-source anti-spam e-mail filters. There are, however, several forms of Naive Bayes, something the anti-spam literature does not always acknowledge. We discuss five different versions of Naive Bayes, and compare them on six new, non-encoded datasets, that contain ham messages of particular Enron users and fresh spam messages. The new datasets, wh...

متن کامل

Automatically Generating Counterexamples to Naive Free Theorems

Disproof can be as important as proof in studying programs and programming languages. In particular, a somehow conditioned statement about program behavior is sometimes best understood and explored by trying to exhibit a falsifying example in the absence of a condition in question. Automation is as desirable for such falsification as it is for verification tasks. We develop formal and implement...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Systems and Software

سال: 2022

ISSN: ['0164-1212', '1873-1228']

DOI: https://doi.org/10.1016/j.jss.2021.111095